Overview

Dataset Statistics

Number of Variables 16
Number of Rows 22553
Missing Cells 36116
Missing Cells (%) 10.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 232.9 MB
Average Row Size in Memory 10.6 KB
Variable Types
  • Numerical: 3
  • Categorical: 12
  • DateTime: 1

Dataset Insights

job_id is uniformly distributed Uniform
requirement_summary has 240 (1.06%) missing values Missing
benefit_summary has 491 (2.18%) missing values Missing
user_keywords has 12630 (56.0%) missing values Missing
industry has 22553 (100.0%) missing values Missing
job_id is skewed Skewed
experience is skewed Skewed
title has a high cardinality: 16347 distinct values High Cardinality
description has a high cardinality: 20587 distinct values High Cardinality
requirement_summary has a high cardinality: 17333 distinct values High Cardinality
benefit_summary has a high cardinality: 10619 distinct values High Cardinality
user_keywords has a high cardinality: 8058 distinct values High Cardinality
soft_skills has a high cardinality: 13918 distinct values High Cardinality
technical_skills has a high cardinality: 19966 distinct values High Cardinality
industry has all distinct values Unique
  • 1
  • 2

Variables


job_id

numerical

Approximate Distinct Count 22553
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 352.4 KB
Mean 11276
Minimum 0
Maximum 22552
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • job_id is uniformly distributed

Quantile Statistics

Minimum 0
5-th Percentile 1014.84
Q1 5525.24
Median 11163.24
Q3 16801.75
95-th Percentile 21311.75
Maximum 22552
Range 22552
IQR 11276.51

Descriptive Statistics

Mean 11276
Standard Deviation 6510.6346
Variance 4.2388e+07
Sum 2.5431e+08
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.5774
  • job_id is not normally distributed (p-value 5.684397892719414e-15)

account_id

numerical

Approximate Distinct Count 3274
Approximate Unique (%) 14.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 352.4 KB
Mean 1642.2143
Minimum 0
Maximum 3304
Zeros 2
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • account_id is skewed right (γ1 = 0.0227)

Quantile Statistics

Minimum 0
5-th Percentile 164
Q1 804
Median 1645
Q3 2500
95-th Percentile 3154
Maximum 3304
Range 3304
IQR 1696

Descriptive Statistics

Mean 1642.2143
Standard Deviation 971.1031
Variance 943041.2309
Sum 3.7037e+07
Skewness 0.02273
Kurtosis -1.2454
Coefficient of Variation 0.5913

title

categorical

Approximate Distinct Count 16347
Approximate Unique (%) 72.5%
Missing 0
Missing (%) 0.0%
Memory Size 2.1 MB

Length

Mean 31.0606
Standard Deviation 13.8464
Median 28
Minimum 3
Maximum 117

Sample

1st row Project Coordinato...
2nd row Supply Chain Speci...
3rd row Food and Beverage ...
4th row Quality Assurance ...
5th row People Operations ...

Letter

Count 601183
Lowercase Letter 508528
Space Separator 70128
Uppercase Letter 92655
Dash Punctuation 6759
Decimal Number 7625
  • title contains many words: 6348 words
  • The largest value (manager) is over 2.11 times larger than the second largest value (engineer)

description

categorical

Approximate Distinct Count 20587
Approximate Unique (%) 91.3%
Missing 2
Missing (%) 0.0%
Memory Size 133.0 MB
  • The largest value (<p><img title="Contact Center Specialist At-Home" alt="" src="https://workablehr.s3.amazonaws.com/uploads/photos/21009/1da096cadcd79715f83334af9180859f.jpg" style="display: block; margin: auto;"></p><p><br></p><p>Anomaly Squared is growing again and if you’re looking to join a fun, laid back environment that provides opportunities for personal and professional growth, please consider applying. A² is an innovative customer contact center that offers a launching point for all employees to advance on their career path, either within the company walls, or elsewhere in the future.</p><p><br></p><p><strong>Position Description:</strong></p><p>We are seeking <strong>At-Home</strong> Contact Center Specialists. You would be responsible for qualifying callers for programs, products or services that our clients offer through outbound and inbound calls. We work with some of the best and most recognized companies in their industries, so professionalism and excellent communication skills are a must! Full-time hours with an option for part-time. </p><p>Benefits of this position include personal development such as:</p><p>• Building confidence</p><p>• Gaining and improving communication skills</p><p>• Learning to overcome objections</p><p><br></p><p><strong>Wage</strong></p><p>$10.00 per hour with semi-annual reviews</p>) is over 1.94 times larger than the second largest value (<p>LabCorp Employer Services is a leading provider of biometric testing services, population health and comprehensive workforce wellness strategies. These services are performed by a network of LabCorp Employer Services personnel located throughout the country.</p><p>LabCorp Employer Services is seeking medical professionals to provide testing services at events. Once hired, our staff have the ability to assign themselves to events in their area by utilizing our scheduling system. In addition, we provide pre-event comprehensive training on LES protocols.</p><p>Testing services include biometric screenings, COVID-19 PCR testing, COVID-19 point of care antigen testing, and temperature checks. Our staff are responsible for the successful setup, execution, and breakdown of events while providing exceptional customer service to participants. </p><p><strong>TO APPLY CLICK ON THE LINK BELOW:</strong></p><p><a href="https://www.shiftboard.com/LabCorpEmployerServices/register.html" rel="nofollow noreferrer noopener" class="external">https://www.shiftboard.com/LabCorpEmployerServices/register.html</a></p><p><strong>TO BE CONSIDERED FOR THIS ROLE YOU MUST COMPLETE AN APPLICATION ON OUR WEBSITE!</strong><br></p>)

Length

Mean 2353.973
Standard Deviation 1480.1524
Median 2097
Minimum 0
Maximum 20637

Sample

1st row


...

2nd row

Droplette is re...

3rd row

Fotogra...

4th row

Acolad ...

5th row

The People Oper...

Letter

Count 41974958
Lowercase Letter 40593326
Space Separator 7323874
Uppercase Letter 1381632
Dash Punctuation 124405
Decimal Number 173872
  • description contains many words: 129801 words

requirement_summary

categorical

Approximate Distinct Count 17333
Approximate Unique (%) 77.7%
Missing 240
Missing (%) 1.1%
Memory Size 52.2 MB
  • The largest value () is over 19.17 times larger than the second largest value (<ul> <li>High School Diploma or GED is required</li> <li>Great Verbal and Written Communication Skills</li> <li>Working Knowledge of Windows Based Operating Systems including Google Chrome</li> <li>Can Demonstrate Product Knowledge Once Nesting Period is Complete</li> <li>Ability to Adapt in a Fast Changing Environment</li> <li>Own a computer at home</li> <li>Internet access</li> </ul><p><strong>NOTE:</strong> We are accepting online applications only. Unfortunately, there is no time available to handle additional phone call inquiry's for the limited number of spaces we have open.</p>)

Length

Mean 1094.2062
Standard Deviation 962.5506
Median 901.5
Minimum 0
Maximum 10930

Sample

1st row
2nd row
3rd row
  • An averag...
4th row
  • Universit...
5th row
  • The ideal...

Letter

Count 18968794
Lowercase Letter 18254574
Space Separator 3244749
Uppercase Letter 714220
Dash Punctuation 54347
Decimal Number 67252
  • requirement_summary contains many words: 73242 words

benefit_summary

categorical

Approximate Distinct Count 10619
Approximate Unique (%) 48.1%
Missing 491
Missing (%) 2.2%
Memory Size 28.9 MB
  • The largest value () is over 23.14 times larger than the second largest value (<p><i>Anomaly Squared is an AA/EEO employer. PTO and Healthcare benefits are available for qualifying employees. Please go to </i><a href="http://www.anomalysquared.com/" rel="nofollow noreferrer noopener" class="external"></a><i><a href="http://www.AnomalySquared.com" rel="nofollow noreferrer noopener" class="external">www.AnomalySquared.com</a></i><i> to learn more about our growing company!</i></p>)

Length

Mean 635.7125
Standard Deviation 714.0978
Median 432
Minimum 0
Maximum 8298

Sample

1st row
2nd row

We are committe...

3rd row
  • Health Ca...
4th row

Amplexor offers...

5th row

In return, we c...

Letter

Count 10675335
Lowercase Letter 10193847
Space Separator 1843906
Uppercase Letter 481488
Dash Punctuation 35667
Decimal Number 138449
  • benefit_summary contains many words: 33258 words

created_at

datetime

Distinct Count 22467.364
Approximate Unique (%) 99.6%
Missing 0
Missing (%) 0.0%
Memory Size 176.4 KB
Minimum 2021-08-01 00:23:57.429967
Maximum 2021-12-31 23:33:53.796760

user_keywords

categorical

Approximate Distinct Count 8058
Approximate Unique (%) 81.2%
Missing 12630
Missing (%) 56.0%
Memory Size 1.6 MB

Length

Mean 104.9213
Standard Deviation 92.5201
Median 80
Minimum 6
Maximum 1011

Sample

1st row ['project manageme...
2nd row ['food and beverag...
3rd row ['human resources'...
4th row ['Lab Automation',...
5th row ['javascript', 'us...

Letter

Count 729469
Lowercase Letter 666969
Space Separator 93782
Uppercase Letter 62500
Dash Punctuation 1208
Decimal Number 1764
  • user_keywords contains many words: 7560 words
  • The largest value (sales) is over 1.51 times larger than the second largest value (management)

employment_type

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 21
Missing (%) 0.1%
Memory Size 1.6 MB
  • The largest value (Full-time) is over 18.97 times larger than the second largest value (Part-time)

Length

Mean 8.9548
Standard Deviation 0.3256
Median 9
Minimum 5
Maximum 9

Sample

1st row Full-time
2nd row Full-time
3rd row Full-time
4th row Full-time
5th row Full-time

Letter

Count 180286
Lowercase Letter 157754
Space Separator 0
Uppercase Letter 22532
Dash Punctuation 21484
Decimal Number 0
  • The top 2 categories (Full-time, Part-time) take over 50.0%
  • The largest value (fulltime) is over 18.97 times larger than the second largest value (parttime)

function

categorical

Approximate Distinct Count 37
Approximate Unique (%) 0.2%
Missing 21
Missing (%) 0.1%
Memory Size 1.7 MB

Length

Mean 12.2522
Standard Deviation 5.2709
Median 11
Minimum 5
Maximum 22

Sample

1st row Project Management
2nd row Supply Chain
3rd row Management
4th row Quality Assurance
5th row Human Resources

Letter

Count 266076
Lowercase Letter 233554
Space Separator 8804
Uppercase Letter 32522
Dash Punctuation 0
Decimal Number 0

industry

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.5 MB

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 67659
Lowercase Letter 67659
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • industry has words of constant length

experience

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 80
Missing (%) 0.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 351.1 KB
Mean 0.3654
Minimum 0
Maximum 1
Zeros 262
Zeros (%) 1.2%
Negatives 0
Negatives (%) 0.0%
  • experience is skewed right (γ1 = 0.9035)

Quantile Statistics

Minimum 0
5-th Percentile 0.1
Q1 0.2
Median 0.3
Q3 0.5
95-th Percentile 0.8
Maximum 1
Range 1
IQR 0.3

Descriptive Statistics

Mean 0.3654
Standard Deviation 0.1966
Variance 0.03866
Sum 8211
Skewness 0.9035
Kurtosis 0.9614
Coefficient of Variation 0.5382
  • experience is not normally distributed (p-value 3.124929078562578e-12)
  • experience has 194 outliers

education

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 21
Missing (%) 0.1%
Memory Size 1.8 MB
  • The largest value (Bachelor's Degree) is over 2.9 times larger than the second largest value (High School or equivalent)

Length

Mean 18.8394
Standard Deviation 3.6434
Median 17
Minimum 9
Maximum 25

Sample

1st row Bachelor's Degree
2nd row Bachelor's Degree
3rd row Bachelor's Degree
4th row Bachelor's Degree
5th row Bachelor's Degree

Letter

Count 374377
Lowercase Letter 329574
Space Separator 33413
Uppercase Letter 44803
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Bachelor's Degree, High School or equivalent) take over 50.0%

collar_color

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 21
Missing (%) 0.1%
Memory Size 1.5 MB
  • The largest value (White) is over 6.66 times larger than the second largest value (Blue)

Length

Mean 4.8695
Standard Deviation 0.3369
Median 5
Minimum 4
Maximum 5

Sample

1st row White
2nd row White
3rd row White
4th row White
5th row White

Letter

Count 109719
Lowercase Letter 87187
Space Separator 0
Uppercase Letter 22532
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (White, Blue) take over 50.0%
  • The largest value (white) is over 6.66 times larger than the second largest value (blue)

soft_skills

categorical

Approximate Distinct Count 13918
Approximate Unique (%) 61.8%
Missing 18
Missing (%) 0.1%
Memory Size 3.0 MB
  • The largest value ([]) is over 4.79 times larger than the second largest value (['communication'])

Length

Mean 72.4483
Standard Deviation 66.6545
Median 59
Minimum 2
Maximum 623

Sample

1st row ['attention to det...
2nd row ['interdisciplinar...
3rd row []
4th row ['problem solving ...
5th row ['self-starter', '...

Letter

Count 1187476
Lowercase Letter 1187476
Space Separator 115947
Uppercase Letter 0
Dash Punctuation 12318
Decimal Number 22
  • soft_skills contains many words: 2450 words
  • The largest value (communication) is over 2.2 times larger than the second largest value (detail)

technical_skills

categorical

Approximate Distinct Count 19966
Approximate Unique (%) 88.6%
Missing 18
Missing (%) 0.1%
Memory Size 6.1 MB
  • The largest value ([]) is over 5.65 times larger than the second largest value (['windows based operating systems', 'google chrome'])

Length

Mean 203.4527
Standard Deviation 183.2636
Median 153
Minimum 2
Maximum 1757

Sample

1st row ['publication', 'h...
2nd row ['needle - free pl...
3rd row ['forecasting', 'f...
4th row ['iso 9001', 'six ...
5th row ['hr software', 'h...

Letter

Count 3297396
Lowercase Letter 3297396
Space Separator 452402
Uppercase Letter 0
Dash Punctuation 8479
Decimal Number 12066
  • technical_skills contains many words: 18129 words
  • The largest value (management) is over 2.15 times larger than the second largest value (data)

Interactions

Correlations

Missing Values